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SAR86 denotes a 16S clade of gammaproteobacteria that are ubiquitous in ocean surface waters. So far, SAR86 is resistant to cul- 
tivation; thus, little is known about the genome contents or physiology of this clade. Recently, four partial genome sequences for 
SAR86 subclades I and II were published. Here, we present the draft genome sequence of a single cell from SAR86 subgroup Ilia 
isolated from coastal waters in San Diego, CA. 
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The 16S clade SAR86 has been observed in the microbial com- 
munities of each of the major oceans (1-5). Multiple sub- 
groups are present within the SAR86 16S phylogeny, including the 
subgroups I, II, Ilia, and Illb, though it is not known whether 
these are related to differences in ecophysiology (6). The sequenc- 
ing of an ~40-kbp bacterial artificial chromosome (BAG) and 
fosmid clones generated from a variety of locations revealed that 
some SAR86 genomes contain a gene that encodes a rhodopsin 
capable of generating a proton motive force (PMF) using light 
energy (proteorhodopsin [7-9]), with divergences between the 
subclades. Recently, two near-complete genome sequences from 
subclades I and II were assembled from metagenomic data and 
provided insight into the metabolic potential of SAR86 (10). The 
two subclades displayed distinct environmental distributions re- 
lated to temperature, suggesting an ecophysiological Hnk to a phy- 
logenetic divergence, though representatives from other clades are 
necessary to make this conclusion. 

Seawater was collected at the Scripps Institution of Oceanog- 
raphy (SIO) pier on 9 April 2009 at 8:30 a.m. The water tempera- 
ture was 13.5°C, with a salinity of 33.5 practical salinity units 
(PSUs) and a chlorophyll fluorescence of 2.7 /i,g liter" ^ (SIO Au- 
tomated Shore station equipment, http://www.sccoos.org). Single 
cells were sorted and amplified using multiple displacement am- 
plification (MDA) technology as described by Dupont et al. (10). 

The MDA reaction mixture was sequenced on one lane of an 
lUumina GAIIx. The reads were trimmed with Trimmomatic, and 
iterative de novo assemblies were performed using the CLC 
Genomics workbench. The read set was normalized to reduce cov- 
erage biases caused by MDA. To aid with gap closure, high- 
identity (>95% nucleotide identity) paired-end metagenomic 
reads from the Global Ocean Sampling Project (11) were aligned 
to the contigs to help orient and order them. Gap closure was 
performed using PGR and Sanger sequencing. Annotation was 
performed using the Prokaryotic Annotation Pipeline (12) with 
visualization and manual curation in Manatee. We have assem- 



bled a genome of 7 scaffolds of 13 contigs totaling 1.40 Mbp, 1,442 
putative coding sequences, 33 tRNAs, and 1 rRNA operon. 

A phylogenetic analysis of the full-length 16S gene placed the 
single cell in SAR86 subclade Ilia, along with two BAGs (accession 
no. AY033328 and AF268217); thus, we have named the genome 
SAR86_IIIb_sincell 1 . The genome contains many of the same fea- 
tures observed by Dupont et al. (10), including an overabundance 
of tonS receptors, a complete Emden-Meyerhof-Parnas glycolytic 
pathway, a green-light-absorbing proteorhodopsin, and the 
molecular machinery for dimethylsulfoniopropionate (DMSP) 
and glutathione assimilation. Like the other genomes, it also lacks 
several synthetic pathways, including those for vitamins and the 
carotenoids needed for light harvesting. Uniquely, SAR86_ 
IIIb_sincelll contains a hypervariable region with poor coverage 
in ocean metagenomes. This hypervariable region contains the 
genes involved in outer membrane modification, including those 
for putative glycosyltransferases, fucose acetylases, GDP-mannose 
dehydratases, and acetyltransferases. A similar scenario has been 
observed in the genomic islands of marine SynecJiococcus strains 
WH8102 and GG9311, where it has been suggested to alter the 
interactions with predators and phages (13, 14). 

Nucleotide sequence accession number. This whole-genome 
shotgun project was deposited in NGBI under the accession no. 
AMWXOOOOOOOO (BioProject 170317). 
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